Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature

Identifieur interne : 000616 ( Main/Exploration ); précédent : 000615; suivant : 000617

Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature

Auteurs : Pedro Coelho [États-Unis] ; Amr Ahmed [États-Unis] ; Andrew Arnold [États-Unis] ; Joshua Kangas [États-Unis] ; Abdul-Saboor Sheikh [États-Unis] ; P. Xing [États-Unis] ; W. Cohen [États-Unis] ; F. Murphy [États-Unis]

Source :

RBID : ISTEX:AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4

Abstract

Abstract: Slif uses a combination of text-mining and image processing to extract information from figures in the biomedical literature. It also uses innovative extensions to traditional latent topic modeling to provide new ways to traverse the literature. Slif provides a publicly available searchable database (http://slif.cbi.cmu.edu). Slif originally focused on fluorescence microscopy images. We have now extended it to classify panels into more image types. We also improved the classification into subcellular classes by building a more representative training set. To get the most out of the human labeling effort, we used active learning to select images to label. We developed models that take into account the structure of the document (with panels inside figures inside papers) and the multi-modality of the information (free and annotated text, images, information from external databases). This has allowed us to provide new ways to navigate a large collection of documents.

Url:
DOI: 10.1007/978-3-642-13131-8_4


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature</title>
<author>
<name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
</author>
<author>
<name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
</author>
<author>
<name sortKey="Arnold, Andrew" sort="Arnold, Andrew" uniqKey="Arnold A" first="Andrew" last="Arnold">Andrew Arnold</name>
</author>
<author>
<name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
</author>
<author>
<name sortKey="Sheikh, Abdul Saboor" sort="Sheikh, Abdul Saboor" uniqKey="Sheikh A" first="Abdul-Saboor" last="Sheikh">Abdul-Saboor Sheikh</name>
</author>
<author>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
</author>
<author>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
</author>
<author>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1007/978-3-642-13131-8_4</idno>
<idno type="url">https://api.istex.fr/document/AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000684</idno>
<idno type="wicri:Area/Istex/Curation">000676</idno>
<idno type="wicri:Area/Istex/Checkpoint">000196</idno>
<idno type="wicri:doubleKey">0302-9743:2010:Coelho P:structured:literature:image</idno>
<idno type="wicri:Area/Main/Merge">000621</idno>
<idno type="wicri:Area/Main/Curation">000616</idno>
<idno type="wicri:Area/Main/Exploration">000616</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature</title>
<author>
<name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation>
<wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Arnold, Andrew" sort="Arnold, Andrew" uniqKey="Arnold A" first="Andrew" last="Arnold">Andrew Arnold</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation>
<wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Sheikh, Abdul Saboor" sort="Sheikh, Abdul Saboor" uniqKey="Sheikh A" first="Abdul-Saboor" last="Sheikh">Abdul-Saboor Sheikh</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation>
<wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation>
<wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation>
<wicri:noCountry code="no comma">Joint Carnegie Mellon University-University of Pittsburgh Ph.D. Program in Computational Biology</wicri:noCountry>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
<affiliation wicri:level="4">
<country>États-Unis</country>
<placeName>
<settlement type="city">Pittsburgh</settlement>
<region type="state">Pennsylvanie</region>
</placeName>
<orgName type="university">Université Carnegie-Mellon</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2010</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4</idno>
<idno type="DOI">10.1007/978-3-642-13131-8_4</idno>
<idno type="ChapterID">4</idno>
<idno type="ChapterID">Chap4</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Slif uses a combination of text-mining and image processing to extract information from figures in the biomedical literature. It also uses innovative extensions to traditional latent topic modeling to provide new ways to traverse the literature. Slif provides a publicly available searchable database (http://slif.cbi.cmu.edu). Slif originally focused on fluorescence microscopy images. We have now extended it to classify panels into more image types. We also improved the classification into subcellular classes by building a more representative training set. To get the most out of the human labeling effort, we used active learning to select images to label. We developed models that take into account the structure of the document (with panels inside figures inside papers) and the multi-modality of the information (free and annotated text, images, information from external databases). This has allowed us to provide new ways to navigate a large collection of documents.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Pennsylvanie</li>
</region>
<settlement>
<li>Pittsburgh</li>
</settlement>
<orgName>
<li>Université Carnegie-Mellon</li>
</orgName>
</list>
<tree>
<country name="États-Unis">
<region name="Pennsylvanie">
<name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
</region>
<name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
<name sortKey="Ahmed, Amr" sort="Ahmed, Amr" uniqKey="Ahmed A" first="Amr" last="Ahmed">Amr Ahmed</name>
<name sortKey="Arnold, Andrew" sort="Arnold, Andrew" uniqKey="Arnold A" first="Andrew" last="Arnold">Andrew Arnold</name>
<name sortKey="Coelho, Pedro" sort="Coelho, Pedro" uniqKey="Coelho P" first="Pedro" last="Coelho">Pedro Coelho</name>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<name sortKey="Cohen, W" sort="Cohen, W" uniqKey="Cohen W" first="W." last="Cohen">W. Cohen</name>
<name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
<name sortKey="Kangas, Joshua" sort="Kangas, Joshua" uniqKey="Kangas J" first="Joshua" last="Kangas">Joshua Kangas</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Murphy, F" sort="Murphy, F" uniqKey="Murphy F" first="F." last="Murphy">F. Murphy</name>
<name sortKey="Sheikh, Abdul Saboor" sort="Sheikh, Abdul Saboor" uniqKey="Sheikh A" first="Abdul-Saboor" last="Sheikh">Abdul-Saboor Sheikh</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
<name sortKey="Xing, P" sort="Xing, P" uniqKey="Xing P" first="P." last="Xing">P. Xing</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000616 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000616 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:AE5DBE1CCB5FEE2907CEDC3DD02F2B7AFBA41CA4
   |texte=   Structured Literature Image Finder: Extracting Information from Text and Images in Biomedical Literature
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024